Understanding the Performance of Low Power Raspberry Pi Cloud for Big Data
نویسندگان
چکیده
Abstract: Nowadays, Internet-of-Things (IoT) devices generate data at high speed and large volume. Often the data require real-time processing to support high system responsiveness which can be supported by localised Cloud and/or Fog computing paradigms. However, there are considerably large deployments of IoT such as sensor networks in remote areas where Internet connectivity is sparse, challenging the localised Cloud and/or Fog computing paradigms. With the advent of the Raspberry Pi, a credit card-sized single board computer, there is a great opportunity to construct low-cost, low-power portable cloud to support real-time data processing next to IoT deployments. In this paper, we extend our previous work on constructing Raspberry Pi Cloud to study its feasibility for real-time big data analytics under realistic application-level workload in both native and virtualised environments. We have extensively tested the performance of a single node Raspberry Pi 2 Model B with httperf and a cluster of 12 nodes with Apache Spark and HDFS (Hadoop Distributed File System). Our results have demonstrated that our portable cloud is useful for supporting real-time big data analytics. On the other hand, our results have also unveiled that overhead for CPU-bound workload in virtualised environment is surprisingly high, at 67.2%. We have found that, for big data applications, the virtualisation overhead is fractional for small jobs but becomes more significant for large jobs, up to 28.6%.
منابع مشابه
Feasibility of Raspberry Pi 2 based Micro Data Centers in Big Data Applications
Many new data centers have been built in recent years in order to keep up with the rising demand for server capacity. These data centers consume a lot of energy, need a lot of cooling equipment and occupy big stretches of land. Energy efficiency of data centers is becoming an increasingly hot topic. Researchers and companies continuously look for ways to bring down energy consumption. This pape...
متن کاملA Low-Cost Energy-Efficient Raspberry Pi Cluster for Data Mining Algorithms
Data mining algorithms are essential tools to extract information from the increasing number of large datasets, also called Big Data. However, these algorithms demand huge amounts of computing power to achieve reliable results. Although conventional High Performance Computing (HPC) platforms can deliver such performance, they are commonly expensive and power-hungry. This paper presents a study ...
متن کاملRaspberry Pi 2 as an Feasible Alternative for Cloud Based Parallel Computing Solutions
Data centres use about 250 350 TWh of electric energy per year. About 33% of the data centres power consumption comes from IT equipment. ARM devices are 3 to 4 times more efficient than the traditional x86 based devices [5]. In recent years, ARM processors have been used in small devices such as the Raspberry Pi [23]. The next generation, the Raspberry Pi 2 model B, has a higher clocked quad-co...
متن کاملCapabilities of Raspberry Pi 2 for Big Data and Video Streaming Applications in Data Centres
Many new data centres have been built in recent years in order to keep up with the rising demand for server capacity. These data centres require a lot of electrical energy and cooling. Big data and video streaming are two heavily used applications in data centres. This paper experimentally investigates the possibilities and benefits of using cheap, low power and widely supported hardware in the...
متن کاملDistributed Neural Networks for Internet of Things: The Big-Little Approach
Nowadays deep neural networks are widely used to accurately classify input data. An interesting application area is the Internet of Things (IoT), where a massive amount of sensor data has to be classified. The processing power of the cloud is attractive, however the variable latency imposes a major drawback for neural networks. In order to exploit the apparent trade-off between utilizing the av...
متن کامل